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1  Introduction 


The  problem  that  we  are  addressing  in  this  paper  is  the  following.  Suppose  we 
are  given  a  suite  of  heterogeneous  sensors  and  their  controlling  agents  which  are 
measuring  the  dynamics  of  a  plant.  For  example,  in  the  case  where  the  agents 
are  monitoring  a  battlefield,  the  agents  may  be  providing  radar  samples,  scout 
information,  or  satellite  information.  We  assume  that  each  sensor  agent  has  a 
model  of  the  plant  dynamics  relative  to  its  own  domain.  Moreover,  we  assvune 
that  the  sensor  agents  interact  with  each  other  in  real  time  via  a  communications 
network.  The  fundamental  problem  is  to  find  an  unbiased  estimate  of  the  plant 
state  given  that  the  data  available  is  dynamic,  noisy,  and  given  in  a  multiplicity 
of  representations.  The  approach  proposed  in  the  study  is  unique  because  it 
does  not  attempt  to  transform  the  data  to  a  common  representation.  Rather 
we  establish  a  firamework  which  we  call  the  Multiple  Agent  Hybrid  Estimation  » 
Architecture  (MAHEA),  see  Figure  1,  in  which  we  allow  heterogeneous  data 
to  fiow  between  individucJ  eigents  in  the  network  to  improve  their  individual 
estimates  of  the  current  plant  state. 

The  outline  of  this  paper  is  as  follows.  We  steirt  by  providing  a  brief  descrip¬ 
tion  of  a  MAHEA  agent,  the  basics  of  a  MAHEA  agent  models  of  the  plant  and 
the  procedure  it  uses  to  improve  its  plant  estimate  over  time.  Basically  each 
agent  of  MAHEA,  formulates  a  relaxed  variationid  optimization  problem  whose 
successful  resolution  produces  ein  estimate  of  the  plant.  Each  agent  operates  as 
a  real-time  theorem  prover  in  the  domain  of  relaxed  variational  theory  [31].  In 
section  2,  we  shall  give  more  details  on  the  formulation  of  the  relaxed  variationd 
optimization  problem  and  how  the  agent  solves  this  problem.  In  section  3,  we 
provide  a  general  description  of  how  the  agent  theorem  prover  operates.  We 
note  that  the  operation  of  the  agent  described  in  sections  2  and  3  are  essentially 
the  same  as  the  operation  of  an  agent  in  our  Multiple  Agent  Hybrid  Control 
Architecture,  see  [30,  22],  which  can  be  used  to  control  a  variety  of  processes 
including  automated  manufacturing  ,  multimedia  networks,  flexible  gun  tubes, 
and  flight  planning  for  missiles,  and  traffic  management  of  highways. 

The  new  results  of  this  paper  are  presented  in  sections  4  and  5.  In  section  4, 
we  shall  show  how  we  can  construct  a  special  optimization  criterion  for  the  pljint 
estimation  optimization  problem  which  we  call  the  Estimation  Lagrangian.  The 
type  of  Lagrangian  that  we  need  to  construct  is  special  for  the  plant  estimation 
problem  in  that  we  want  the  Lagrangian  to  be  0  at  each  point  where  the  agent 
has  reached  the  desired  estimate  of  the  plant  state.  Such  Lagrangians  are  not 
suitable  for  most  control  problems.  Finally  in  section  5,  we  shall  describe  our 
mechanism  to  solve  the  problems  of  agent  synchronization  and  how  agents  with 
different  models  can  produce  coherent  (synchronous)  and  compatible  (common 
view)  estimates  of  a  plant.  Our  solution  to  these  problems  uses  the  Noether 
invariance  conditions  in  a  novel  way. 
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Framework 


2  The  Multiple  Agent  Hybrid  Estimation  Ar¬ 
chitecture 

In  this  section,  we  describe  the  main  operational  and  functional  characteristics 
an  agent  in  a  MAHEA  network.  As  we  mentioned  in  the  introduction,  our 
Multiple  Agent  Hybrid  Estimation  Architecture  is  implemented  as  a  distributed 
system  composed  of  agents  and  a  communication  network  which  we  call  the 
logic  communication  network.  The  architecture  realizing  this  system  operates 
as  an  on-line  distributed  theorem  prover.  At  any  update  time,  each  antive 
agent  generates  estimation  actions  as  a  side  effect  of  proving  an  existentially 
quantified  subtheorem  (lemma)  which  encodes  the  model  of  the  plant  as  viewed 
by  the  agent.  The  conjunction  of  lemmas  at  each  instant  of  time,  encodes  the 
desired  behavior  of  the  entire  network.  The  niunber  of  agents  in  the  network  is 
variable.  That  is,  the  system  can  spawn  new  agents  and  deactivate  agents  as  a 
function  of  system  demand,  see  Figure  3.  Each  agent  of  MAHEA,  consists  of 
five  modules:  a  Planner,  a  Dynamic  Knowledge  Base,  a  Deductive  Inferencer, 
an  Adapter  and  a  Knowledge  Decoder.  We  briefly  overview  the  functionality  of 
an  agent  in  terms  of  its  modules  . 

The  basic  architecture  of  an  estimation  agent  is  pictured  in  Figure  2.  The 
agent  consists  of  5  modules  with  the  following  functionality: 

1.  Planner  The  Planner  constructs  and  repairs  the  agent  state  estimation 
optimization  criteria  which  we  refer  to  as  the  Estimation  Lagrangian  as¬ 
sociated  with  the  agent.  In  particular,  the  Planner  generates  a  statement 
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representing  the  desired  model  of  the  estimation  system  as  an  existentially 
quantified  logic  expression  herein  referred  to  as  the  Estimation  Statement. 

2.  Inferencer  The  Inferencer  determines  whether  there  is  a  state  estimate 
for  the  agent’s  relaxed  variational  state  estimation  problem  which  is  a 
near  optimal  solution  where  the  agent’s  Estimation  Lagrangian  is  used  as 
a  cost  function.  If  there  is  such  a  solution,  the  agent  infers  a  near  optimal 
estimation  and  sends  data  to  the  other  agents.  Otherwise  it  infers  failure 
terms  and  a  new  state  for  the  agent  and  reports  the  failure  to  the  other 
agents.  In  particular,  the  Inferencer  determines  whether  the  Estimation 
Statement  is  a  theorem  in  the  theory  currently  active  in  the  knowledge 
base.  If  the  Estimation  Statement  logically  follows  from  the  current  status 
of  the  Knowledge  Base,  the  inferencer  generates,  as  a  side  effect  of  proving  ^ 
this  Estimation  Statement  to  be  true,  the  current  state  estimate  of  the 
plant.  If  the  Estimation  Statement  does  not  logically  follow  from  the 
current  status  of  the  Knowledge  Base,  that  is,  the  desired  behavior  is  not 
realizable,  the  inferencer  transmits  the  failed  terms  to  the  Adapter  module 
for  replacement  or  modification. 

3.  Adapter  The  Adapter  repairs  failure  terms  and  constructs  correction 
terms. 

4.  Knowledge  Base  The  Knowledge  Base  stores  and  updates  the  agent’s 
plant  model  and  constraints.  The  Knowledge  Base  also  stores  the  require¬ 
ments  of  operations  or  processes  within  the  scope  of  the  agent’s  estimation 
problem.  It  also  encodes  system  constraints,  inter-agent  protocols  and 
constraints,  sensory  data,  operational  and  logic  principles  and  a  set  of 
primitive  inference  operations  defined  in  the  domain  of  equational  terms. 

5.  Knowledge  Decoder  The  Knowledge  Decoder  receives  and  translates 
the  other  agent’s  data. 

To  better  understand  how  these  five  modules  function,  we  first  need  to  dis¬ 
cuss  the  basic  elements  of  an  agent’s  model  and  how  it  behaves.  We  will  discuss 
this  model  in  the  next  section  and  we  will  then  follow  with  a  more  detailed 
discussion  of  the  five  modules  of  a  MAHEA  agent. 

2.1  An  MAHEA  Agent’s  Model 

In  general,  a  hybrid  system  has  a  hybrid  state,  the  simultaneous  dynamical 
state  of  all  plants  and  digital  control  devices.  Properly  construed,  the  hybrid 
states  will  form  a  differentiable  manifold  which  we  call  the  carrier  manifold  of 
the  system.  To  incorporate  the  digital  states  as  certain  coordinates  of  points  of 
the  carrier  manifold,  we  “continualize”  the  digital  states.  That  is,  we  view  the 
digital  states  as  finite,  real-valued,  piecewise-constcuit  functions  of  continuous 
time  and  then  we  take  smooth  approximations  to  them.  This  also  sJlows  us  to 
consider  logical  and  differential  or  variational  constraints  on  the  same  footing, 
each  restricting  the  points  allowed  on  the  carrier  manifold.  In  fact,  all  logical 
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MAHEA  Sensor  Agent 


Goal  Agent  State  Estimate 


Figure  2;  MAHEA  Agent  Architecture 


or  discontinuous  features  can  be  continualized  without  practical  side-effects. 
This  is  physically  correct  since  for  any  semiconductor  chip  used  in  an  analog 
device,  the  zeros  and  ones  are  really  just  abbreviations  for  sensor  readings  of 
the  continuous  state  of  the  chip.  Every  constraint  of  the  system,  digital  or 
continuous,  is  incorporated  into  the  definition  of  what  points  are  on  the  carrier 
mainifold,  Lagrange  constraints  are  regarded  as  part  of  the  definition  of  the 
manifold  as  well,  being  restrictions  on  what  points  are  on  the  manifold. 

More  specifically,  let  A*,  i  =  l,...,iV’(t)  denote  the  agents  active  at  the 
current  time  i.  In  our  model,  t  takes  values  on  the  real  line.  At  each  time  t,  the 
status  of  each  agent  in  the  network  is  given  by  a  point  in  a  locally  differentiable 
manifold  M  [21].  The  Estimation  Lagrangian  Li  of  an  active  agent  Ai  is  given 
by  a  continuous  function, 

Li  *  M  X  r  — >•  (1) 

where  T  is  the  real  line  (time  space)  and  is  the  positive  real  line.  A  point  p 
in  the  manifold  M  is  represented  by  a  data  structure  of  the  form: 

p(idjproc(proc-data)jestimate( estimation^daia)fin(synch^data)f  mp(mult^data))  (2) 

Here  id  is  an  identifier  taking  values  in  a  finite  set  ID,  procQ  is  a  relation 
characterizing  plant  processes  status  which  depends  on  a  list  of  parameters 
labeled  proc^data,  whose  parameters  define  the  operational,  load,  and  timing 
cheuracteristics  of  the  process  involved.  The  relation  estimate  captures  attributes 
of  the  plant  being  represented  which  depends  on  a  list  of  parameters  labeled 
estimation^data  whose  parameters,  among  other  things,  characterize  various 
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constraints  of  the  plant  representation  of  an  agent  at  a  level  of  abstraction 
compatible  with  the  logic  communication  network.  The  relation  m()  carries 
syndironization  information  of  the  logic  communication  network.  This  includes 
information  such  as  priority  level,  connectivity  and  time  constants.  Finally,  the 
relation  mp()  carries  multiplicity  information,  that  is,  it  represents  the  level  of 
network  usability  at  this  point.  The  associated  parameter  list,  mult^data,  is 
composed  of  statistical  parameters  reflecting  the  logic  network’s  load. 

Rom  an  agent’s  point  of  view,  the  dynamics  of  the  plant  is  characterized 
by  certain  trajectories  on  the  manifold  M.  These  trajectories  represent  the 
agent  estimate  of  the  state  of  the  plant  plus  the  flow  of  information  through  the 
network  and  its  status.  Specifically,  we  need  to  define  two  items: 

(i)  The  Estimation  Lagrangian  functions  : 

{I,(p,t):teJ(t)}  (3) 

where  I{t)  is  the  set  of  active  agents  at  time  t  and 

(ii)  the  actions  or  estimates  issued  by  the  agents. 

We  will  see  shortly  that  these  actions  are  implemented  as  infinitesimal  trans¬ 
formations  defined  in  M,  The  general  structure  of  an  Estimation  Lagrangian 
function  in  (3)  for  an  active  agent  i  at  time  t  is  given  in  (4)  below: 


Li(p,t)  =  Fi{Ui,L,ai)(p,t)  (4) 

where  Fi  is  a  smooth  function,  L  is  the  vector  of  Estimation  Lagrangian  func¬ 
tions,  Ui  is  the  state  estimation  error  function,  and  a*  is  the  command  action 
issued  by  the  i-th  agent.  We  will  devote  the  rest  of  this  subsection  to  charac¬ 
terizing  this  model. 

We  start  with  a  discussion  of  the  main  characteristics  of  the  manifold  M. 
In  general  a  manifold  M  is  a  topological  space  (with  topology  0  )  composed  of 
three  items: 

(a)  A  set  of  points  which  in  our  case  will  be  of  the  form  of  (2). 

(b)  A  countable  family  of  open  subsets  of  M,  Ui  such  that 

\JUi  =  M. 

i 

(c)  A  family  of  smooth  homeomorphisms,{</>i  :  <pi  :Ui-^  K},  where  for  each  j, 

Vj  is  an  open  set  in  The  sets  Ui  are  referred  to  in  the  literature  as 
coordinate  neighborhoods  or  charts.  For  each  chart  Uj  the  corresponding 
function  (pi  is  referred  to  as  its  coordinate  chart. 

The  coordinate  chart  functions  satisfy  the  following  additional  condition: 

Given  any  charts  17*  and  Uj  such  that  UiOUj  ^  0,  the  function 
(pi  o  (pj^  :  (pj{Ui  n  Uj)  (pj{Ui  n  Uj)  is  smooth. 
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In  the  literature,  one  usually  finds  an  additional  property,  which  is  the  Haus- 
dorff  property  in  the  definition  of  manifolds  [30].  Since  this  property  does  not 
hold  in  our  application  we  will  not  discuss  it. 

Now  we  proceed  to  customize  the  generic  definition  of  the  manifold  to  our 
application.  We  start  with  the  topology  ©  associated  with  M.  We  note  that  the 
points  of  M  have  a  definite  structure,  see  (2),  whose  structure  is  characterized 
by  the  values,  or  more  precisely  by  intervals  of  values,  of  the  parameters  in  the 
lists  proc^datay  estimation^datay  synch^data  and  mulLdata,  The  number  of  these 
parameters  equals  k.  The  knowledge  about  these  parameters  is  incorporated 
into  the  model  by  defining  a  finite  topology  on  [3]. 

The  open  sets  in  f2  are  constructed  from  the  clauses  encoding  what  we  know 
about  the  parameters.  The  topology  0  of  M  is  defined  in  terms  of  fi  as  follows: 
For  each  open  set  W  in  Q,  such  that  W  C  V}  C  i?*',  we  require  that  the  set  ^ 
(f>J^{W)  be  in  0.  The  sets  constructed  in  this  way  form  a  basis  for  ©  so  that 
a  set  C/  C  M  is  open  if  and  only  if  for  each  p  €  C/,  there  is  j  and  an  open  set 
W  €  n  such  that  W  C  y,  and  p  €  <t>jHW)^ 

To  characterize  the  actions  cdmmanded  by  a  MAHEA  agent  we  need  to 
introduce  the  concept  of  derivations  on  M.  Let  Fp  be  the  space  of  real  valued 
smooth  functions  /  defined  in  a  neighborhood  a  point  p  in  M .  Let  /  and  g  be 
functions  in  Fp.  A  derivation  v  of  Fp  is  a  map 

V  :  Fp  — >  Fp 

that  satisfies  the  following  two  properties: 

v{f  +  g)(p)  =  W)  +  v{g))(p)  (Linearity)  (5) 

v{f  ■  9)(p)  =  {v{f)  ■  g  +  f  ■  v{g))(p)  (  Leibniz  Rule)  (6) 

Derivations  define  vector  fields  on  M  and  a  class  of  associated  curves  called 
integral  ciirves  [22].  Suppose  that  C  is  a  smooth  curve  on  M  parameterized  by 
%l)  :  I  M  where  /  is  a  subinterval  of  R,  In  local  coordinates,  p  =  (p^,  ..mP^)j 
C  is  given  by  k  smooth  functions  ^{t)  =  whose  derivative 

with  respect  to  t  is  denoted  by  ^(t)  =  We  introduce  an 

equivalence  relation  on  curves  in  M  as  the  basis  of  the  definition  of  tangent 
vectors  at  a  point  in  M  [13].  Two  curves  and  ^2(f)  passing  through  a 

point  p  are  said  to  be  equivalent  at  p  (notation:  ^i(t)  '02 (t)),  if  there  exists 

Ti ,  r2  €  /  such  that 

01  (ti)  =02(r2)  =p  (7) 

0i(n)  =  02(r2).  (8) 

Clearly,  ^  defines  an  equivalence  relation  on  the  class  of  curves  in  M  passing 
through  p.  Let  [0]  be  the  equivalence  class  containing  0.  A  tangent  vector  to 
[0]  is  a  derivation,  t;|p,  such  that  in  local  coordinates  (p^, . . .  ,p^),  it  satisfies  the 
condition  that  given  any  smooth  function  f  \  M  Ry 

f. 

v\Af)iP)='E^ii)^  (9) 

i=o 
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where  p  =  The  set  of  tangent  vectors  associated  with  all  the  equivalence 
classes  at  p  defines  a  vector  space  called  the  tangent  vector  space  at  p,  denoted 
by  TMp,  The  set  of  tangent  spaces  associated  with  points  in  M  can  be  “glued” 
together  to  form  a  manifold  called  the  tangent  bundle  which  is  denoted  by  TM. 

TM=  \J  TMp 

p^M 

For  our  purposes,  it  is  important  to  specify  explicitly  how  this  gluing  is  imple¬ 
mented.  This  will  be  explained  below  after  we  introduce  the  concept  of  a  vector 
field  and  discuss  its  relevance  in  the  model. 

A  vector  field  on  M  is  an  assignment  of  a  derivation  u|p  to  each  point  p  of 
M  which  varies  smoothly  from  point  to  point.  That  is,  if  p  =  (p^,...,p*)  are 
local  coordinates,  then  we  can  always  write  v\p  in  the  form 

(10) 

Then  u  is  a  vector  field  if  the  coordinate  functions  Xi  are  smooth. 

Comparing  (9)  and  (10)  we  see  that  if  is  a  parameterized  curve  in  M  whose 
tangent  vector  at  any  point  coincides  with  the  value  of  u  at  a  point  p  =  7p{t), 
then  in  the  local  coordinates  p  =  (t), . . . ,  we  must  have 

^{t)  =  X^ip)  for  ;  =  1, . . . ,  k,  (11) 

In  our  application,  each  command  issued  by  the  MAHEA  agent  is  imple¬ 
mented  as  a  vector  field  in  Af .  Each  agent  constructs  its  command  field  as  a 
combination  of  ‘primitive’  predefined  vector  fields.  Since  the  chosen  topology 
for  M,  0,  is  not  metrizable,  we  cannot  guarantee  a  unique  solution  to  (11)  in 
the  classical  sense  for  a  given  initial  condition.  However,  they  have  solutions  in 
a  class  of  continuous  trajectories  in  M  called  relaxed  curves  [26].  In  this  class, 
the  solutions  to  (11)  are  unique.  We  discuss  the  basic  characteristics  of  relaxed 
curves  as  they  apply  to  our  process  control  formulation  and  implementation 
in  Section  3.  Next,  we  describe  some  of  their  properties  as  they  relate  to  our 
plant  model  and  estimation  process.  For  this  objective,  we  need  to  introduce 
the  concept  of  flows  in  M, 

If  u  is  a  vector  field,  any  parameterized  curve  passing  through  a  point  p  in 
M  is  called  an  integral  curve  associated  with  u,  if  in  local  coordinates  (11)  holds. 
An  integral  curve  associated  with  a  field  v,  denoted  by  ^(t,p)  is  termed  the  flow 
generated  by  v  if  it  satisfies  the  following  properties: 

^{t,  ®(r,p))  =  $(t  -h  r,p)  (semigroup  property)  (12) 

^(0>P)  =P  (initial  condition) 

and 

d 

— ’^(t,p)  =  V  (flow  generation) 
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Flow  associated  with  actions 
of  agent i 


^i,n  j“  ^  ^1.02"  ^u»2  ^  ^103  ®ija3  (P**)  A 


Figure  3:  Conceptual  illustration  of  agent  action  schedule 


Now  we  are  ready  to  customize  these  concepts  for  our  model.  Suppose  that 
a  MAHEA  agent  Ai  is  active.  Let  A  >  0  be  the  width  of  the  current  decision 
interval,  A).  Let  Ui(p,t)  be  the  state  estimation  error  at  the  beginning  of 
the  interval.  Agent  Ai  has  a  set  of  primitive  actions: 

{vij  :  j  =  1, . . . , Hi  where  Vij\p  6  TMp  for  each  p  E  M}  (13) 

During  the  interval  +  A),  agent  Ai  schedules  one  or  more  of  these  actions 
to  produce  a  flow  which  will  reduce  the  state  estimation  error.  In  particular, 
Ai  determines  the  fraction  OLij{p,t)  of  A  that  action  Vij  must  be  executed  as  a 
function  of  the  current  estimation  requests  5r,i(f  ,p)  and  the  vector  of  Estimation 
Lagrangians  L{p,t)  =  (Li(p,t),.  ..,L7v-(e))  of  the  active  agents  in  the  MAHEA 
network.  Figure  3  conceptually  illustrates  a  schedule  of  actions  involving  three 
primitives.  We  will  use  this  example  as  means  for  describing  the  derivation  of 
our  model.  The  general  case  is  similar. 

The  flow  associated  with  the  schedule  of  Figure  3  can  be  computed  from 
the  flows  associated  with  each  of  the  actions: 


(t,p)  if  t  <  t  <  t  +  A<,„1 

0>  _  J  (t,p))  if  t  +  Ai,„,  <T<t  +  Ai,„,  +  Ai,„j 

k  if  f  +  Ai,„,  +  Aj,„j  <t  +  Ai,„j  +  Ai,„j  +  Ai,„3 

(14) 

where  A  =  Ai,„j  +  Ai,„j  +  Ai,ns  and  aj,„i  +  ai,„j  +  ai,„3  =  1.  We  note  that 
the  flow  given  by  (14)  characterizes  the  evolution  of  the  process  as  viewed 
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by  agent  A*.  The  vector  field  Vi\p  associated  with  the  flow  is  obtained  by 
differentiation  and  the  third  identity  in  (12).  This  vector  field  applied  at  p  is 
proportional  to 

^ilp  “  bi,ni  1  »  ^1,113]]  (15) 

where  is  the  Lie  bracket  due  to  the  parallelogram  law,  see  [34].  The  Lie 

bracket  is  defined  as  follows:  Let  v  and  lu,  be  derivations  on  M  and  let  /  : 

M  iZ  be  any  real  valued  smooth  functions.  The  Lie  bracket  of  v  and  w  is  the 
derivation  defined  by 

[v,w]{f)  =  v{w{f))  -w{v{f)), 

see  [9]. 

Thus  the  composite  action  Vi\p  generated  by  the  z-th  agent  to  improve  the 
state  estimation  is  a  composition  of  the  form  of  (15).  Moreover  from  a  version  * 
of  the  Chattering  lemma  and  duality  [18],  we  can  show  that  this  action  can  be 
expressed  as  a  linear  combination  of  the  primitive  actions  available  to  the  agent 
as  follows. 

[Vt,n2j'^i,ns]] 

J 

with  the  coefficients  jj  determined  by  the  fraction  of  time  that  each  primitive 
action  Vij  is  used  by  agent  i. 

The  effect  of  the  field  defined  by  the  composite  action  Uijp  on  any  smooth 
function  (equivalent  function  class)  is  computed  by  expanding  the  right  hand 
side  of  (14)  in  a  Lie-Taylor  series  [9].  In  particular,  we  can  express  the  change 
in  the  state  estimation  error  Ui  due  to  the  flow  over  the  interval  A  in  terms  of 
Vi\p.  The  evolution  of  the  state  estimation  error  Ui  over  the  interval  starting  at 
point  p  and  ending  at  a  point  p"  is  given  by 

Ui{t  +  A,p")  =  Ui{t,  ^i{t  +  A,p))  (17) 

Expanding  the  right  hand  side  of  (17)  in  a  Lie-Taylor  series  around  (t,p), 
we  obtain, 

} 

where 

(«ilp(-))^  =  Vi\p  ^(ui|p(-))^"^)  (18) 

and 

(^dp)^/)  =  /•  for  all  / 

In  general,  the  series  in  the  right  hand  side  of  (18)  will  have  countable  many 
non-zero  terms.  In  our  case,  since  the  topology  of  M  is  finite  because  it  is 


=  (16) 
3 

=  1 
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generated  by  finitely  many  logic  clauses,  this  series  will  have  only  finitely  many 
non-zero  terms.  Intuitively,  this  is  so  because  in  computing  powers  of  deriva¬ 
tions  (i.e,,  limits  of  differences),  we  need  only  to  distinguish  among  different 
neighboring  points.  In  our  formulation  of  the  topology  of  M,  this  can  only  be 
imposed  by  the  information  in  the  clauses  of  the  agent’s  Knowledge  Base.  Since 
each  agent’s  knowledge  base  has  only  finitely  many  clauses,  there  is  a  term  in  the 
expansion  of  the  series  in  which  the  power  of  the  derivation  goes  to  zero.  This 
is  important  because  it  allows  the  series  in  the  right  hand  side  to  be  effectively 
generated  by  a  locally  finite  automaton.  We  will  expand  on  the  construction 
of  this  automaton  in  the  next  section  when  we  discuss  the  inference  procedure 
carried  out  by  each  agent. 

We  note  that  given  the  set  of  primitive  actions  available  to  each  agent,  the 
composite  action  is  determined  by  the  vector  of  fraction  functions  aj  .  We  will  » 
see  in  the  next  section  that  this  vector  is  inferred  by  each  agent  from  the  proof 
of  existence  of  solutions  of  an  optimization  problem. 

Now  we  can  write  the  specific  nature  of  the  model  formulated  in  expression 
(4).  At  time  t  and  at  point  p  e  M  the  estimation  error  function  of  agent  i  is 
given  by  : 

Ui(p,  t)  =  Ui{p,  t~)  +  Sr,i(p,  t)  +  Qi,kLk(p,  t~)  (19) 

k 

where  t”  is  the  end  point  of  the  previous  update  interval,  Sr,i  is  the  estimation 
request  function  to  agent  i,  and  Qi^k  is  a  multiplier  determining  the  required 
degree  of  accuracy  and  the  urgency  of  the  estimate  of  the  Agent  k  that  Agent 
i  requires.  This  allocation  is  determined  from  the  characteristics  of  the  process 
both  agents  are  estimating  and  from  the  process  description  encoded  in  the 
agent’s  knowledge  base.  The  actual  request  from  agent  k  to  agent  i  is  thus  the 
term,  The  information  sent  to  agent  i  by  agent  k  is  the  state 

estimation  function  Ljk(p,t~)  at  the  end  of  the  previous  interval.  Finally  the 
point  p  e  M  carries  the  current  estimate  of  the  process  monitored  by  the  agents 
appearing  in  (19).  Agent  k  thus  contribute  to  Agent  i’s  new  estimate  only  if 

Qi,k  ^  0* 

This  concludes  our  description  of  the  model.  For  space  considerations,  some 
details  have  been  left  out.  In  particular  those  related  to  the  strategy  for  activa¬ 
tion  and  deactivation  of  agents.  These  will  be  discussed  in  a  future  paper. 

3  The  Five  Modules 

In  each  agent  of  MAHEA,  the  Estimation  Statement  is  the  formulation  of  a  re¬ 
laxed  vaxiationeJ  optimization  problem  whose  successful  resolution  produces  an 
action  schedule  of  the  form  of  (15).  Each  agent  operates  as  a  real-time  theorem 
prover  in  the  domain  of  relaxed  variational  theory  [31].  A  customized  version 
of  this  theory,  enriched  with  elements  of  differential  geometry,  equational  logic 
and  automata  theory  provides  a  general  representation  for  the  dynamics,  con¬ 
straints,  requirements  and  logic  of  the  Estimation  Agent  network  .  We  devote 
the  rest  of  this  section  to  the  discussion  of  the  main  elements  of  this  theory  in 
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the  context  of  the  operational  features  of  MAHEA.  The  architecture  is  com¬ 
posed  of  two  items:  The  Control  Agent,  and  the  logic  communication  Network. 
These  items  are  illustrated  in  Figures  2  and  4  respectively.  We  will  discuss  them 
in  the  remaining  of  this  section. 


3.1  Architectural  Elements  of  a  control  agent: 

We  will  discuss  next  the  functionality  of  the  five  modules  of  a  control  agent. 
These  are:  the  Knowledge  Base,  the  Planner,  the  Inferencer,  the  Knowledge 
Decoder  and  the  Adapter. 


3.1.1.  Knowledge  Base:  The  Knowledge  Base  consists  of  a  set  of  equational 
first  order  logic  clauses  with  second  order  extensions.  The  syntax  of  clauses  is 
similar  to  the  ones  in  the  Prolog  language.  Each  clause  is  of  the  form 

Head  -f-  Body  (20) 

where  Head  is  a  functional  form,  p(a;i,  taking  values  in  the  binary  set 

[true,  false]  with  xi,  xa, . . . ,  a:n  variables  or  parameters  in  the  domain  M  of  the 
MAHEA  network.  The  symbol  ^  stands  for  logical  implication.  The  variables 
appearing  in  the  clause  head  are  assumed  to  be  universally  quantified.  The 
Body  of  a  clause  is  a  conjunction  of  one  or  more  logical  terms, 

ei  A  62  A  .  .  .  A  Cn  (21) 

where  A  is  the  logical  ‘and’.  Each  term  in  (21)  is  a  relational  form.  A  relational 
form  is  one  of  the  following:  an  equational  form,  an  inequational  form,  a  covering 
form,  or  a  clause  head.  The  logical  value  of  each  of  these  forms  is  either  true 
or  false.  A  relational  form  e*  is  true  precisely  at  the  set  of  tuples  of  values  Si 
of  the  domain  taken  by  the  variables  where  the  relational  form  is  satisfied  and 
is  false  for  the  complement  of  that  set.  Thus  for  ei  =  et(xi, . . .  ,Xn),  Si  is  the 
possibly  empty  subset  of  , 

Si  =  {(xi,...,Xn)  e  M”  :  ei(xi,...,i„)  =  true) 

so  that 

e<(xi, . . .  ,x„)  =  false  if  (xi, . . .  ,x„)  € 


The  generic  structure  of  a  relational  form  is  given  in  Table  1. 


Form 

Structure 

Meaning 

equational 
inequational 
covering 
clause  head 

u;(xi,...,Xn)  =  t)(Xi,...,X„) 
w{xi,...,Xn)  U(X1,...,X„) 

u;(xi,...,x„)  <  v(xi,...,Xn) 
q{Xi  )  -  •  •  1  Xn) 

equal 
not  equal 
partial  order 
recursion,chaining 

Table  1.  Structure  of  the  Relational  Form 
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0  Spawned  Agent 


Deactivated  Agent  Q  Active  Agent 


Deactivated  Link 


Active  Link 


Figure  4:  Network  of  Cooperating  Control  Agents 


In  Table  1,  w  and  v  are  polynomic  forms  with  respect  to  a  finite  set  of  op¬ 
erations  whose  definitional  and  property  axioms  are  included  in  the  Knowledge 
Base.  A  polynomic  form  u  is  an  object  of  the  form  v(xi , . . . ,  Xn)  = 
where  is  the  free  monoid  generated  by  the  variable  symbols  {xi, . . . ,  Xn}  un¬ 
der  juxtaposition.  The  term  is  called  the  coefficient  of  v  at  a;.  The 

coefficients  of  a  polynomic  form  v  take  values  in  the  domain  of  definition  of  the 
clauses.  The  domain  in  which  the  variables  in  a  clause  head  take  values  is  the 
mamifold  M  described  in  section  2.  The  logical  interpretation  of  (20)  and  (21)  is 
that  the  Head  is  true  if  the  conjunction  of  the  terms  of  Body  are  jointly  true  for 
instances  of  the  vcuriables  in  the  clause  head,  M  is  contained  in  the  Cartesian 
product  : 

MCGxSxXxA  (22) 

where  G  is  the  space  of  goeds,  S  is  the  space  of  sensory  data,  X  is  the  space  of 
estimation  states  and  A  is  the  space  of  estimation  actions.  These  were  described 
in  section  2.  G,  S,  X,  and  A  are  manifolds  themselves  whose  topological  struc¬ 
ture  are  defined  by  the  specification  clauses  in  the  Knowledge  Beise,  see  Figure 
5.  These  clauses,  which  are  application  dependent,  encode  the  requirements  on 
the  closed-loop  behavior  of  the  estimation  model  of  the  agent.  In  fact  the  closed 
loop  behavior,  which  we  will  define  later  in  this  section  in  terms  of  a  variational 
formulation,  is  characterized  by  continuous  ciirves  with  values  in  M .  This  con¬ 
tinuity  condition  is  central  because  it  is  equivalent  to  requiring  the  system  to 
look  for  actions  that  make  the  closed  loop  behavior  satisfy  the  requirements  of 
the  plant  model. 
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The  denotational  semantics  of  each  clause  in  the  knowledge  base  is  one  of 
the  following: 

1.  a  conservation  principle, 

2.  an  invariance  principle,  or 

3.  a  constraint  principle. 

Conservation  principles  are  one  or  more  clauses  about  the  balance  of  a  par¬ 
ticular  process  in  the  dynamics  of  the  system  or  the  computational  resources. 
For  instance,  equation  (19)  encoded  as  a  clause  expresses  the  conservation  of 
error  for  the  Agent  I’s  estimation  of  the  plant. 


con$evationjof^error(p,  t,  [Qi,k],  Sr.u  [Lk],  A,  Ui{t,p)) 

U,(t  +  2A,p")  =  ^  A 

i 

/*  encoding  of  equation  (13)  */ 


Ui{t  +  A,p')  =  Ui(t,p)  +  Sr,i(t,p)  +  ^  Qi,kLk{t,p,p)  A  (23) 

k 

/*  encoding  of  equation  (15)  */ 
process^evolution{pytyp'^)  A  /*  encoding  of  equation  (13)  */ 
consevationjofjerror{p",t  +  A,  5r,i,  [ijb],  A,  Ui{t  +  2A,p") 


Conservation  principles  always  involve  recursion  whose  scope  is  not  neces¬ 
sarily  a  single  clause,  as  in  the  example  above,  but  with  chaining  throughout 
several  clauses. 

Invariance  principles  are  one  or  more  clauses  establishing  constants  of  the 
evolution  of  agent’s  estimation  error  functions  in  a  general  sense.  These  prin¬ 
ciples  include  stationarity,  which  plays  a  pivotal  role  in  the  formulation  of  the 
theorems  proved  by  the  architecture,  and  geodesics.  For  example,  in  the  state 
estimation  of  multimedia  processes,  invariant  principles  specify  quality  response 
requirements.  That  is,  levels  of  performance  as  a  function  of  traffic  load  that  the 
system  must  satisfy.  The  importance  of  invariance  principles  lies  in  the  reference 
they  provide  for  the  detection  of  unexpected  events.  For  example,  in  the  state 
estimation  of  a  multimedia  process,  the  update  time  after  a  request  is  serviced 
is  constant,  under  normal  operating  conditions.  In  the  knowledge  base  of  each 
agent,  there  is  an  equational  clause  that  states  this  invariance  has  a  ground  form 
that  is  constant.  If  this  ground  form  does  not  evaluate  to  the  required  constant, 
then  the  agent  knows  that  there  is  a  deviation  from  normality. 

Constraint  principles  axe  clauses  representing  engineering  limits  to  actuators 
or  sensors  and,  most  importantly,  rules  of  engagement. 

The  clause  database  is  organized  in  a  nested  hierarchical  structure  illus¬ 
trated  in  Figure  5.  The  bottom  of  this  hierarchy  contains  the  equations  that 
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Model  Builder  Realization 


Dynamic  Estimation  Specifications 


Estimation  Performance  Specifications 


Generic 

Estimation 

Specifications 


Process 

Representation 


Goal 

Class 

Representation 


Laws  of  the  Variety 


Figure  5:  Knowledge  Base  Organization 


characterize  the  algebraic  structure  defining  the  terms  of  relational  forms,  i.e. 
an  algebraic  variety  [36]. 

At  the  next  level  of  the  hierarchy,  three  types  of  clauses  are  stored:  Generic 
Estimation  Specifications,  Battlefield  Representation  and  Goal  Class  Represen¬ 
tation. 

The  Generic  Estimation  Specifications  are  clauses  expressing  general  desired 
behavior  of  the  system.  They  include  statements  about  stability,  complexity  and 
robustness  that  are  generic  to  the  class  of  declarative  rational  controllers.  These 
specifications  are  written  by  constructing  clauses  that  combine  laws  of  the  kind 
which  use  the  Horn  clause  format  used  in  (23). 

The  Process  Representation  is  given  by  clauses  characterizing  the  dynamic 
behavior  and  structure  of  the  plant,  which  includes  sensors  and  actuators.  These 
clauses  are  written  as  conservation  principles  for  the  dynamic  behavior  and  as 
invariance  principles  for  the  structure.  As  for  the  Generic  Estimation  Specifi¬ 
cations,  they  are  constructed  by  combining  a  variety  of  laws  in  the  equationed 
Horn  clause  format. 

The  Goal  Class  Representation  contains  clauses  characterizing  sets  of  desir¬ 
able  operation  points  in  the  domain  (points  in  the  manifold  Af).  For  example, 
Goal  Class  clauses  could  specify  the  type  and  size  of  permitted  state  estimation 
errors.  These  clauses  are  expressed  as  soft  constraints;  that  is,  constraints  that 
can  be  violated  for  finite  intervals  of  time.  They  express  the  ultimate  purpose 
of  the  controller  but  not  its  behavior  over  time. 

The  next  level  of  the  hierarchy  involves  the  Estimation  Performance  Specifi¬ 
cations.  These  are  typically  problem-dependent  criteria  and  constraints.  They 
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are  written  in  equational  Horn  clause  format.  They  include  generic  constraints 
such  as  speed  and  time  of  response,  and  qualitative  properties  of  state  trajecto¬ 
ries  [31].  Dynamic  Estimation  Specifications  are  equational  Horn  clauses  whose 
bodies  are  modified  as  a  function  of  the  sensor  and  goal  commands. 

Finally,  Model  Builder  Realization  clauses  constitute  a  recipe  for  building  a 
procedural  model  (an  automaton)  for  generating  variable  instantiation  (unifica¬ 
tion)  and  for  theorem  proving. 


3.1.2.  The  Planner; 

The  function  of  the  theorem  Planner,  which  is  domain-specific,  is  to  generate, 
for  each  update  interval,  a  symbolic  statement  of  the  desired  behavior  of  the 
system,  as  viewed,  say  by  the  agent  j,  throughout  the  interval.  The  theorem 
statement  that  it  generates  has  the  following  form: 


Given  a  set  of  primitive  actions  there  exists  state  estimation  action 
schedule  Vi\p  of  the  form  (16)  and  a  fraction  function  differential 
da(-)  (Figure  3)  in  the  control  interval  \t,t  -1-  A)  such  that  da(-) 
minimizes  the  functional 

Li{9i{T,p),Vi\p{GiiT,p)))da{p,dr)  (24) 

subject  to  the  following  constraints: 


gi{Si,9i{t  +  A,p)) 


L  ‘ 


da(p,  dr) 


(local  goal  for  the  interval), 

=  Vi{p,t)  (25) 

(inter-agent  contraint,  see  (19)) 

and 


=  1 


In  (24),  Li  is  the  Estimation  Lagrangian  of  the  system  as  viewed  by  Agent 
i  for  the  current  interval  of  control  [t,t  -f  A).  This  function,  which  maps  the 
Cartesian  product  of  the  state  and  estimation  action  spaces  into  the  real  line 
with  the  topology  defined  by  the  clauses  in  the  knowledge  base,  captures  the 
dynamics,  constraints  and  requirements  of  the  system  as  viewed  by  agent  i 
The  Local  Estimation  Lagrangian  function  Li  is  a  continuous  projection  in  the 
topology  defined  by  the  knowledge  base  (see  [28])  in  the  coordinates  of  the  z-th 
agent  of  the  global  Lagrangian  function  L  that  characterizes  the  system  as  a 
whole.  In  (25),  p  represents  the  state  of  the  process  under  control  as  viewed 
by  the  agent  and  G*  is  the  parallel  transport  operator  bringing  the  goal  to  the 
current  interval.  The  operator  Gi  is  constructed  by  lifting  to  the  manifold  the 
composite  flow  (see  equation  (14)).  We  note  that  the  composite  flow  and  the 
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Figure  6:  Illustration  of  optimization 


action  schedule  are 'determined  once  the  fraction  function  is  known  and  that 
this  function  is  the  result  of  the  optimization  (24),  (25).  In  particular,  the 
action  schedule  is  constructed  as  a  linear  combination  of  primitive  actions  (see 
equation  (16)). 

The  term  da(*)  in  (24)  is  a  Radon  probability  measure  [32]  on  the  set  of 
primitive  estimation  actions  or  derivations  that  the  agent  can  execute  for  the 
interval  [t,t  *f  A).  It  measures,  for  the  interval,  the  percentage  of  time  to  be 
spent  in  each  of  the  primitive  derivations.  The  centrad  function  of  the  control 
agent  is  to  determine  this  mixture  of  actions  for  each  control  interval.  This 
function  is  carried  out  by  each  agent  by  inferring  from  the  current  status  of  the 
knowledge  base  whether  a  solution  of  the  optimization  problem  stated  by  the 
current  theorem  exists,  and,  if  so,  to  generate  corresponding  actions  and  state 
updates.  Figure  6  illustrates  the  relations  between  the  primitive  actions  and 
the  fraction  of  A  they  are  active  in  the  interval  [t,  i  4*  A). 

The  expressions  in  (25)  constitute  the  constraints  imposed  in  the  relaxed  op¬ 
timization  problem  solved  by  the  agent.  The  first  one  is  the  loceJ  goal  constraint 
expressing  the  general  value  of  the  state  at  the  end  of  the  current  interval.  The 
second  represents  the  constraints  imposed  on  the  agent  by  the  other  agents  in 
the  network.  Finally,  the  third  one  indicates  that  da{-)  is  a  probability  measure. 
Under  relaxation  and  with  the  appropriate  selection  of  the  domain,  see  [19],  the 
optimization  problem  stated  in  (24)  and  (25)  is  a  convex  optimization  problem. 
This  is  important  because  it  guarantees  that  if  a  solution  exists,  it  is  unique 
up  to  probability,  and  also,  it  guarantees  the  computational  effectiveness  of  the 
inference  method  that  the  agent  uses  for  proving  the  theorem. 
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The  construction  of  the  theorem  statement  given  by  (24)  and  (25)  is  the  cen¬ 
tral  task  carried  out  in  the  Planner.  It  characterizes  the  desired  behavior  of  the 
process  as  viewed  by  the  agent  in  the  current  interval  so  that  its  requirements 
are  satisfied  and  the  system  “moves”  towards  its  goal  in  an  optimal  manner. 


3.1,3.  Adapter; 

The  function  under  the  integral  in  (24)  includes  a  term,  referred  to  as  the 
“catch-air  potential,  which  is  not  associated  with  any  clause  in  the  Knowledge 
Base.  Its  function  is  to  measure  unmodeled  dynamic  events.  This  monitoring 
function  is  carried  out  by  the  Adapter  which  implements  a  generic  conamutator 
principle  similar  to  the  Lie  bracket  discussed  in  section  2,  see  (24).  Under  this 
principle,  if  the  value  of  the  catch-all  potential  is  empty,  the  current  theorem 
statement  adequately  models  the  statxis  of  the  system.  On  the  other  hand,  if  the 
theorem  fails,  meaning  that  there  is  a  mismatch  between  the  current  statement 
of  the  theorem  and  system  status,  the  catch-all  potential  carries  the  equational 
terms  of  the  theorem  that  caused  the  failure.  These  terms  are  negated  and 
conjuncted  together  by  the  Inferencer  according  to  the  commutation  principle 
(which  is  itself  defined  by  equational  clauses  in  the  Knowledge  Base)  and  stored 
in  the  Knowledge  Base  as  an  adaptation  dynamic  clause.  The  Adapter  then 
generates  a  potentid  symbol,  which  is  characterized  by  the  adaptation  clause 
and  corresponding  tuning  constraints.  This  potential  is  added  to  criterion  for 
the  theorem  cheiracterizing  the  interval. 

The  new  potential  symbol  and  tuning  constraints  are  sent  to  the  Planner 
which  generates  a  modified  Estimation  Lagrangian  for  the  agent  and  goal  con¬ 
straint.  The  new  theorem,  thus  constructed,  represents  adapted  behavior  of  the 
system.  This  is  the  essence  of  reactive  structural  adaptation  in  the  our  model 

At  this  point,  we  pause  in  our  description  to  address  the  issue  of  robustness. 
To  a  large  extent,  the  adapter  mechanism  of  each  controller  agent  provides  the 
system  with  a  generic  and  computationally  effective  means  to  recover  firom  fail¬ 
ures  or  unpredictable  events.  Theorem  failures  are  symptoms  of  mismatches 
between  what  the  agent  thinks  the  system  looks  like  and  what  it  really  looks 
like.  The  adaptation  clause  incorporates  knowledge  into  the  agent’s  Knowledge 
Base  which  represents  a  recovery  strategy.  The  Inferencer,  discussed  next,  ef¬ 
fects  this  strategy  as  part  of  its  normal  operation. 


3.1.4.  Inferencer: 

The  Inferencer  is  an  on-line  equational  theorem  prover.  The  class  of  theorems 
it  can  prove  are  represented  by  statements  of  the  form  of  (20)  and  (21),  expressed 
by  an  existentially  quantified  conjunction  of  equational  terms  of  the  form: 

3Z{W^{Z,p)  reli  ViiZ,p)  A  ...  A  Wn{Z,p)  rek  V„iZ,p))  (26) 

where  Z  is  a  tuple  of  variables  each  talking  values  in  the  domain  P,  p  is  a  list  of 
parameters  in  £),  and  {Wi,  Vi]  are  polynomial  terms  in  the  semiring  polynomial 
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algebra: 


D{n)  (27) 

with  D  =  {Dj  (+,  1, 0))  a  semiring  algebra  with  additive  unit  0  and  multiplica¬ 
tive  unit  1.  In  (26),  re/i,  i  =  1, . . .  ,n  are  binary  relations  on  the  polynomial 
algebra.  Each  reZ*  can  be  either  an  equality  relation  (=  ),  inequality  relation 
(y^ ),  or  a  partial  order  relation  (<).  In  a  given  theorem,  more  than  one  partial 
order  relation  may  appear.  In  each  theorem,  at  least  one  of  the  terms  is  a  partial 
order  relation  that  defines  a  complete  lattice  on  the  algebra;  that  corresponds  to 
the  optimization  problem.  This  lattice  has  a  minimum  element  if  the  optimiza¬ 
tion  problem  has  a  minimum.  Given  a  theorem  statement  of  the  form  of  (26) 
and  a  knowledge  base  of  equational  clauses,  the  inferencer  determines  whether 
the  statement  logically  follows  from  the  clauses  in  the  Knowledge  Base,  and  if 
so,  as  a  side  effect  of  the  proof,  generates  a  non-empty  subset  of  tuples  with  * 
entries  in  M  giving  values  to  Z.  These  entries  determine  the  agent’s  actions. 
Thus,  a  side  effect  is  instantiation  of  the  agent’s  decision  variables.  In  (27), 
n  is  a  set  of  primitive  unary  operations,  {vi},  the  infinitesimal  primitive  fields 
defined  in  section  2.  Each  t;,  maps  the  semiring  algebra,  whose  members  are 
power  series  involving  the  composition  of  operators,  on  Z  to  itself: 

T;i  :  D{{Z))  D{{Z))  (28) 

These  operators  are  characterized  by  axioms  in  the  Knowledge  Base,  and  are 
process  dependent.  In  formal  logic,  the  implemented  inference  principle  can 
be  stated  as  follows:  Let  S  be  the  set  of  clauses  in  the  Knowledge  Base.  Let 
=>  represent  implication.  Proving  the  theorem  means  to  show  that  it  logically 
follows  from  S,  i.e. 

E  =»  theorem.  (29) 

The  proof  is  accomplished  by  sequences  of  applications  of  the  following  in¬ 
ference  axioms: 

(i)  equality  axioms 

(ii)  inequality  axioms 

(iii)  partial  order  axioms 

(iv)  compatibility  axioms 

(v)  convergence  axioms 

(vi)  knowledge  base  axioms 

(vii)  limit  axioms 

The  specifics  of  these  inference  axioms  can  be  found  in  [12]  where  it  is  shown 
that  each  of  the  inference  principles  can  be  expressed  as  an  operator  on  the 
Cartesian  product: 

D{{W))  X  D{{W))  (30) 
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Inference  Rules:  ,  b|  ,  C|  ,  d|  ,  ej 

Figure  7;  Conceptual  Structure  of  the  Proof  Automaton 


Each  inference  operator  transforms  a  relational  term  into  another  relational 
term.  The  inferencer  applies  sequences  of  inference  operators  on  the  equational 
terms  of  the  theorem  imtil  these  terms  are  reduced  to  either  a  set  of  ground 
equations  of  the  form  of  (31)  or  it  determines  that  no  such  ground  form  exists. 

Zi  =  ctijOLi  €  D  (31) 

The  mechemism  by  which  the  inferencer  carries  out  the  procedure  described 
above  is  by  building  a  procedure  for  variable  goal  instantiation:  a  locally  finite 
automaton.  We  refer  to  this  automaton  as  the  Proof  Automaton.  This  impor¬ 
tant  feature  is  unique  to  our  approach.  The  proof  procedure  is  customized  to 
the  particular  theorem  statement  and  Knowledge  Base  instance  it  is  currently 
handling.  The  structure  of  the  proof  automaton  generated  by  the  inferencer  is 
illustrated  in  Figure  7. 

In  Figure  7,  the  initial  state  represents  the  equations  associated  with  the 
theorem.  In  general,  each  state  corresponds  to  a  derived  equational  form  of  the 
theorem  through  the  application  of  a  chain  of  inference  operators  to  the  initial 
state  that  is  represented  by  the  path. 

Each  edge  in  the  automaton  corresponds  to  one  of  the  possible  inferences. 
A  state  is  terminal  if  its  equational  form  is  a  tautology,  or  it  corresponds  to 
a  canonical  form  whose  solution  form  is  stored  in  the  Knowledge  Base,  In 
traversing  the  automaton  state  graph,  values  or  expressions  axe  assigned  to 
the  variables.  In  a  terminal  state,  the  equational  terms  are  all  ground  states 
(see  (31)).  K  the  automaton  contains  at  least  one  path  starting  in  the  initial 
state  and  ending  in  a  terminal  state,  then  the  theorem  is  true  with  respect  to 
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Figure  8:  Summary  of  Inferencer  Procedure 


the  given  Knowledge  Base  and  the  resulting  variable  instantiation  is  a  valid 
one.  1£  this  is  not  the  case,  the  theorem  is  false.  The  function  of  the  complete 
partial  order  term  present  in  the  conjimction  of  each  theorem  provable  by  the 
inferencer  is  to  provide  a  guide  for  constructing  the  proof  automaton.  This  is 
done  by  transforming  the  equational  terms  of  the  theorem  into  a  canonical  fixed 
point  equation,  called  the  Kleene-Schutzenberger  Equation  (KSE)  [12],  which 
constitutes  a  blueprint  for  the  construction  of  the  proof  automaton.  This  fixed 
point  coincides  with  the  solution  of  the  optimization  problem  formulated  in  (24) 
(25),  when  it  has  a  solution.  The  general  form  of  KSE  is  : 

Z  =  J5(p)-Z  +  r(p)  (32) 

In  (32),  is  a  square  matrix,  with  each  entry  a  rational  form  constructed  from 
the  basis  of  inference  operators  described  above,  and  T  is  a  vector  of  equational 
forms  from  the  Knowledge  Base.  Each  non-empty  entry,  Eij,  in  E  corresponds 
to  the  edge  in  the  proof  automaton  connecting  states  i  and  j.  The  binary 
operator  between  E{p)  and  Z  represents  the  "apply  inference  to”  operator. 
Terminal  states  are  determined  by  the  non-empty  terms  of  T.  The  p  terms  are 
custom  parameter  values  in  the  inference  operator  terms  in  £?(•). 

A  summary  of  the  procedure  executed  by  the  inferencer  is  presented  in  Figure 

8. 

We  note  that  the  construction  of  the  automaton  is  carried  out  from  the 
canonical  equation  and  not  by  a  non-deterministic  application  of  the  inference 
rules.  This  approach  reduces  the  computational  complexity  of  the  canonical 
equation  (low  polynomic)  and  is  far  better  than  applying  the  inference  rules 
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directly  (exponential). 

The  automaton  is  simulated  to  generate  instances  of  the  state,  action  and 
evaluation  variables  using  an  automaton  decomposition  procedure  [33]  which 
requires  nlog2n  time,  where  n  #  of  states  of  the  automaton.  This  “divide  and 
conquer”  procedure  implements  the  recursive  decomposition  of  the  automaton 
into  a  cascade  of  parallel  unitary  (one  initial  and  one  terminal  state)  automata. 
Eajch  of  the  resulting  automata  on  this  decomposition  is  executed  independently 
of  the  others.  The  behavior  of  the  resulting  network  of  automata  is  identical 
with  the  behavior  obtained  from  the  original  automaton,  but  with  feasible  time 
complexity. 

The  inferencer  for  each  Estimation  Agent  fulfills  two  functions:  to  generate 
a  proof  for  the  system  behavior  theorem  of  each  agent  generated  by  the  Planner 
(equations  (24)  and  (25))  and  to  function  as  the  central  element  in  the  Knowl¬ 
edge  Decoder.  We  now  describe  its  function  for  proving  the  behavior  theorem.  * 
Later,  we  will  overview  its  function  as  part  of  the  Knowledge  Decoder.To  show 
how  the  inferencer  is  used  to  prove  the  Planner  theorem,  (24),  (25),  first,  we 
show  how  this  theorem  is  transformed  into  a  pattern  of  the  form  of  (26).  Since 
(24),  (25)  formulates  a  convex  optimization  problem,  a  necessary  and  sufficient 
condition  for  optimality  is  provided  by  the  following  dynamic  programming  for¬ 
mulation: 

infc  J  Li{^i(T,Y),Vi\p{Gi{T,p)))da^p,  dr)  (33) 
8V- 

inUMy.^.cci) 

where  K  (t)  =  p  and  r  G  [t,  t  +  A) 

In  (33),  the  function  VJ,  called  the  optimal  cost-to-go  function,  charew:terizes 
minimality  starting  from  any  arbitrary  point  inside  the  current  interval.  The 
second  equation  is  the  corresponding  Hamilton- Jacobi-Bellman  equation  for  the 
problem  stated  in  (24)  and  (25)  where  H  is  the  Hamiltonian  of  the  relaxed  prob¬ 
lem.  This  formulation  provides  the  formal  coupling  between  deductive  theorem 
proving  and  optimal  control  theory.  The  inferencer  allows  the  real-time  opti¬ 
mal  solution  of  the  formal  control  problem  resulting  in  intelligent  distributed 
real-time  control  of  the  multiple-agent  system.  The  central  idea  for  inferring  a 
solution  to  (33)  is  to  expand  the  cost-to-go  function  V{., .)  in  a  rational  power 
series  V  in  the  algebra: 

D{{{Y,t)))  (34) 

Replacing  V  for  V}  in  the  second  equation  in  (33),  gives  two  items:  a  set  of 
polynomic  equations  for  the  coefficients  of  V  and  a  partial  order  expression 
for  representing  the  optimality.  Because  of  convexity  and  rationality  of  V,  the 
number  of  equations  to  characterize  the  coefficients  of  V  is  finite.  The  resulting 
string  of  conjunctions  of  coefficient  equations  and  the  optimality  partial  order 
expression  are  in  the  form  of  (26). 

In  summary,  for  each  agent,  the  inferencer  operates  according  to  the  follow¬ 
ing  procedure. 


Vi{Y,T)  = 
dr 
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step  1:  Load  current  theorem  (24),  (25). 

Step  2:  Transform  theorem  to  equational  form  (26)  via  (33). 

Step  3:  Execute  proof  according  to  Figure  8. 

If  the  theorem  logically  follows  from  the  Knowledge  Base  (i.e.,  it  is  true), 
the  inferencer  procedure  will  terminate  on  step  3  with  actions  .  K  the  theorem 
does  not  logically  follow  from  the  Knowledge  Base,  the  Adapter  is  activated, 
and  the  theorem  is  modified  by  the  theorem  Planner  according  to  the  strategy 
outlined  above.  This  mechanism  is  the  essence  of  reactivity  in  the  agent.  Be¬ 
cause  of  relaxation  and  convexity,  this  mechanism  ensures  that  the  estimatable 
set  of  the  domain  is  strictly  larger  than  the  mechanism  without  this  correction 
strategy. 


3,1,5  Knowledge  Decoder; 

The  function  of  the  Knowledge  Decoder  is  to  translate  knowledge  data  from 
the  network  into  the  agent’s  Knowledge  Base  by  updating  the  inter-  agent  speci¬ 
fication  clauses.  These  clauses  characterize  the  second  constraint  in  (33).  Specif¬ 
ically,  they  express  the  constraints  imposed  by  the  rest  of  the  network  on  each 
agent.  They  also  characterize  the  global- to-local  trginsformations  (see  [20]). 
Finally,  they  provide  the  rules  for  building  generalized  multipliers  for  incor¬ 
porating  the  inter-agent  constraints  into  a  complete  unconstrained  criterion, 
which  is  then  used  to  build  the  cost-to-go  function  in  the  first  expression  in 
(33).  A  generalized  multiplier  is  an  operator  that  transforms  a  constrain  into  a 
potential  term.  This  potential  is  then  incorporated  into  the  original  Estimation 
Lagrangian  of  the  agent  which  now  2tccounts  explicitly  for  the  constraint. 

The  Knowledge  Decoder  has  a  built-in  inferencer  used  to  infer  the  structure 
of  the  multiplier  and  transformations  by  a  procedure  similar  to  the  one  de¬ 
scribed  for  (14).  Specifically,  the  mxiltiplier  and  transformations  are  expanded 
in  a  rational  power  series  in  the  algebra  defined  in  (34).  Then  the  necessary  con¬ 
ditions  for  duality  are  used  to  detennine  the  conjunctions  of  equational  forms 
and  a  partied  order  expression  needed  to  construct  a  theorem  of  the  form  of  (26) 
whose  proof  generates  a  multiplier  for  adjoining  the  constraint  to  the  Estimation 
Lagrangian  of  the  agent  as  another  potential. 

The  conjunction  of  equational  forms  for  each  global-to-local  transformation 
is  constructed  by  applying  the  following  invariant  embedding  principle; 

For  each  agent,  the  actions  at  given  time  t  in  the  current  interval, 
as  computed  according  to  (33),  are  the  same  actions  computed  at  t 
when  the  formulation  is  expanded  to  include  the  previous,  current, 
and  next  intervals. 

By  transitivity  and  convexity  of  the  criterion,  the  principle  can  be  analyti¬ 
cally  extended  to  the  entire  horizon.  The  invariant  embedding  equation  has  the 
same  structure  as  the  dynamic  programming  equation  given  in  (33),  but  with 
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Figure  9:  The  Companion  Agent 


the  global  criterion  and  globed  Hamiltonians  instead  of  the  corresponding  local 
ones. 

The  local-tcHglobal  transformations  are  obtained  by  inverting  the  global-to- 
local  transformations,  obtained  by  expressing  the  invariant  embedding  equation 
as  an  equational  theorem  of  the  form  of  (26).  These  inverses  exist  because  of 
convexity  of  the  relaxed  Lagrangian  and  the  rationality  of  the  power  series. 

It  is  important  at  this  point  to  interpret  the  functionality  of  the  Knowledge 
Decoder  of  each  agent  in  terms  of  what  it  does.  The  multiplier  described  above 
has  the  effect  of  aggregating  the  rest  of  the  system  and  the  other  agents  into  an 
equivalent  companion  system  and  companion  agent,  respectively,  as  viewed  by 
the  current  agent.  This  is  illustrated  in  Figure  9. 

The  aggregation  model  (Figure  9)  describes  how  each  agent  perceives  the 
rest  of  the  network.  This  unique  feature  allows  us  to  characterize  the  scalability 
of  the  architecture  in  a  unique  manner.  Namely  in  order  to  determine  compu¬ 
tational  complexity  of  an  application,  we  have  only  to  consider  the  agent  with 
the  highest  complexity  (i.e.,  the  local  agent  with  the  most  complex  criterion) 
and  its  companion. 

4  Constructing  Estimation  Lagrangians. 

As  described  in  the  previous  section,  the  Planner  module  of  each  agent  A  con¬ 
structs  an  Estimation  Lagrangian  L{t,x,x)  such  that  the  desired  evolution  of 
the  system  ocemrs  when  the  system  evolves  along  a  curve  x  =  C{t)  which  mini- 
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mizes 


(35) 


where  z(to)  =  and  x(io + A)  =  xi .  Here  in  local  coordinates,  x  =  (x^ . . . ,  x’^) 
and  X  =  (x^, . . .  ,x^)* 

We  would  like  to  modify  the  agent’s  Estimation  Lagrangian  by  adding  a 
divergence  free  function  to  construct  a  Estimation  Lagrangian  with  a  special 
property  that  when  we  reach  our  desired  estimation  for  the  system  that  the 
resulting  change  in  the  estimation  due  to  the  flow  is  0.  That  is,  suppose  that 
5(t,x)  is  a  smooth  function  {C^  is  enough).  Then 


dt 


dS  A  95  .. 


i=i 


(36) 


Now  suppose  that  we  replace  the  agent’s  original  Lagrangian  by 


j  .  ds 

(37) 

j=i 

For  any  curve  C{t)  with  C(to)  =  and  C{to  +  A)  =  xi,  let 

J(C)  =  /  Lit,Cit),C{t))dt 

Jtn 

(38) 

rto+A 

J{C)  =  /  L{t,C{t),C{t))dt. 

Jto 

(39) 

Then  the  fact  that  the  difference  between  L  and  L  is  a  divergence  free  function 
^  implies 

J(C)  -  J{C)  =  S{t  +  A,  xi)  5(t,  xo).  (40) 

Thus  value  of  the  left  hand  side  of  (40)  does  not  depend  on  C  so  that  that  the 
curve  C*  which  minimizes  J{C)  also  minimizes  J(C). 

We  want  to  construct  a  geodesic  field  for  a  given  agent  Ai  =  A 

=  ^‘(t,  x)  for  i  =  1, . . . ,  n  (41) 

on  the  manifold  M  such  that 

L(t,x,x)  =  0  if  =  ^(t,x)  for  j  =  1, ...  ,n  (42) 

L(t,x,x)  >  0  if  otherwise.  (43) 

Note  that  this  a  very  strong  condition  which  would  ensure  that  evolving  the 
system  along  a  geodesic  would  result  in  no  change  to  our  estimate.  See  (18). 

In  this  section  we  shall  explore  the  conditions  that  such  an  5  must  satisfy 
and  how  we  can  construct  the  desired  geodesic  field. 
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To  this  end,  we  note  that  if  there  exists  a  curve  x  =  C(t)  which  minimizes 


yto 

Jto 


L(t,  X,  x)dt 


(44) 


where  x(to)  =  2:0  and  x{to  +  A)  =  xi,  then  L  must  satisfy  the  Euler-Lagrange 
equations.  That  is,  let  Ej  be  the  operator 

^  dVdxi’  dxi 

for  j  =  1, . . . ,  n.  Then  L  must  satisfy  the  equations 

Ej{L)  =  0  for  j  =  1, . . . , n.  (45) 

This  given,  it  will  be  useful  for  subsequent  formulas  to  rewrite  (45)  in  terms  * 
of  the  canonical  momentum  jp  and  the  Hamiltonian  of  the  system  H.  That  is, 
define 

=  ^it,x,x)i0T  j  =  (46) 


We  shall  assume  that 


(47) 


for  all  t,x,  X.  Given  (47),  we  can  use  the  Inverse  Function  theorem  to  solve  for 
the  x^’s  in  terms  of  i,  t,  and  p.  That  is,  there  are  functions  V  such  that 


,p)  j  =  l,...,n. 

(48) 

We  then  define  the  Hamiltonian  H  of  the  system  by 

n 

H[t,x,p)  =  -L{t,x,x)  + 

i=i 

(49) 

It  then  is  easily  verified  that 

dH 

hi  —  xi, 

dpi 

(50) 

dH 

dL 

dxi 

(51) 

dH 

dL 

(52) 

dt 

dt  * 

It  follows  that  the  Euler-Lagrange  equations  are  equivalent  to  the  following. 

(53) 

It  follows  from  (42)  and  (43)  that 

Z(^,I^...,x",V>^...,V’")  =  miniL{t,x,x).  (54) 


+  =  * . "■ 
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Thus  the  solution  to  (41)  must  satisfy 


«  dL 
°  “  dxi 

dL  dS 
dx^  dx^ 
dS 

= 

Thus  when  =  V(i,x,p)  =  t/j^(t,x)  for  j  =  1, . . .  ,n,  then 

dS 

Pj{t,x,x)  = 

so  that  Pj  is  a  function  of  just  t  and  x.  Moreover  it  must  be  the  case  that 
0  =  L(t,x,x) 

as 


=  L{t,  x,x)  -  —  -  p>h^ 
i=i 


at  . 

j=i 

^  " 
at 


axi 


and  hence  5  satisfies 


as  as.  „ 


(55) 


We  claim  that  if  the  (41),  (42),  and  (43)  hold  and  5(t,x)  is  a  function  such 
that  (55)  holds  and 

then  the  Weierstrass  condition  for  the  existence  of  a  curve  C{t)  which  is  a 
strong  minimum  for  (44)  must  also  hold.  It  then  follows  that  we  can  construct 
an  e-approximation  to  the  geodesic  fields  via  the  techniques  of  [23]. 

To  make  our  statement  precise,  we  make  the  following  definition,  see  [7]. 

Definition  4,1  Suppose  that  x  =  (xi, . . .  ,Xn),  y  =  (yi, . . .  ,yn),  and  we  a  give 
a  function  F  ofZn  variables.  Then  the  Weierstrass  E-function  of  the  functional 


•^[y]  =  [  F{x,y,y)dx,  y{a)  =  A,  y{b)  =  B 
J  a 

the  function  o/  3n  -h  1  variables: 

E{x,  y,z,w)  =  Fix, y, w)  -  F{x,  y,z)  -  z*) 


t=l 


(57) 


(58) 


Then  the  following  result  is  proved  in  [7]. 
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Theorem  4.2  Let  7  6e  a  curve  which  is  extremal  for  the  functional  J[j/]  of  (57) 
and  let 

y*  =  /ori  =  (59) 

be  a  field  for  the  functional  J[y]  of  (57).  Suppose  that  at  every  point  (i,y)  of 
some  (open)  region  containing  7  and  covered  by  the  field  (59),  the  condition 


E{x,y,rp,'w)  >  0 


(60) 


is  satisfied  for  every  finite  vector  w  =  {ufi- , . . . ,  vf*) .  Then  J[y]  has  a  strong 
minimum  for  the  extremal  7. 


Now  under  our  assumptions, 

xJ  1=  h^{t,x,p) 

=  h^{t,x,^) 

=  V^'(t,i)  for  i  =  1,. 

Now  take  ^  of  both  sides  of  (56),  we  get 

dpi  _  d^S  A  d^S 

dt  dtdx^  dx^dx^ 

Jc=l 


(61) 


.X*. 


(62) 


Then  taMng  the  partial  derivative  with  respect  to  x^  of  both  sides  of  (55),  we 
get 

d^S  dH  ^  dH  d^S 

dtdx^  dx^  "  dp^  dx^dx^ 

However  since  x^  =  for  j  =  1, ...  n,  we  can  rewrite  (63)  as 


I  ,  ■f 

dtdx^  dx^  “  dxWx^ 
*=1 


(64) 


(65) 


Combining  (62)  and  (64),  we  get  that 

djp  ^  dH 
dt  dx^ 

for  j  =  1, . .  .n.  Note  that  this  implies  the  system  can  be  cheiracterized  by  the 
equations 

dpi 


and 


dH  . 
dxi  ^ 


(66) 

=  l,...,n. 

(67) 
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Next  observe  that  conditions  (42)  and  (43)  imply  that 


dt  dx3  f-f  dx3 

j=i 


with  equality  holding  only  if 


Then  (55)  can  be  rewritten  as 


=  ^*?(t,x). 


Solving  for  ^  in  (69)  and  substituting  into  (68),  we  set  that 


Lit, X,  x)  -  Lit,  x,7p)  +  ^  “  ^)  >  0 


Thus  since  ^  we  can  derive  that 

L{t,x,x)-L{t,x,i})  +  '^^{x^  -tip)  >0 


Next  define  the  Weierstrass  function  E  by 

QC 

E{t,x,tp,x)  =  L{t,x,x)  -  L{t,x,tl})  +  Y^—{x^ 


Then  (71)  says  that 

£?(t,x,V^,x)  >  0.  (73) 

Moreover  by  applying  Mean  Value  Theorem  twice,  we  get  that 

1  _  r 

E{t,x,tp,x)  =  ■j_'^-QT^{t,x,z){3p  -iP){x'‘  -t!)’’)  (74) 

hk 

for  some  z  =  9x  +  il  —  with  0  <  5  <  1.  Thus  if  jB(t,  x,  tp,  x)  >  0,  it  must  be 
that 

5  Agent  Synchronization 

Agent  synchronization  is  based  on  the  following  result. 


29 


Theorem  5.1  For  any  agents  Aq  and  Ai,  let  Lq  and  Li  be  the  corresponding 
agent  Estimation  Lagrangian  where  for  each  i,  Li  is  a  function  of  the  state 
Xij  Xi,  and  the  agent  clock  time  t.  We  can  explicitly  construct  state  and  clock 
transformation  functions  $2^  and  from  the  Noether  invariance  relations 
such  that  for  any  given  time  interval  I  and  e  >  0, 

J [Loi^o^io^to)  —  Li{^^^{xo^to),^l^{xoyto),^^^{xo^to))]dtQ  <  e.  (75) 

The  significance  of  this  result  is  that  one  can  explicitly  construct  transfer- 
mation  function  to  coherently  fuse  sensor  data.  That  is,  agents  Ao  and  Ai  may 
have  different  models  of  the  plant.  Thus  for  agents  Ao  and  Ai  to  be  able  to 
communicate  with  each  other,  we  need  to  construct  transformations  functions 
which  allow  agent  Aq  to  interprete  agent  Ai’s  state  estimation.  The  theorem  • 
says  that  such  transformation  functions  not  only  exist  but  can  be  computed  for 
the  Noether  invariance  relations.  Moreover  if  agent  Aq  realizes  that  its  state  is 
incompatible  with  the  state  information  given  by  agent  Ai,  agent  Aq  goes  to 
the  adaptation  loop  to  correct  the  incompatibility. 

To  illustrate  Theorem  5.1,  suppose  that  there  is  a  global  state  estimation 
function  for  the  plant  L{t,  x,  x)  and  we  have  two  agents,  Agent  1  with  its  Esti¬ 
mation  Lagrangian  Li(ti,xi,xi)  and  Agent  2  with  its  state  estimation  function 
L2{t2,^2,X2)^  Moreover  assume  that  we  have  state  and  clock  transition  func¬ 
tions  for  Agent  t,  i  =  1,2,  given  by 

x{  =  for  ;  =  l,...,n  (76) 

ti  =  ti{t,x,w)  (77) 

where  w  =  (u;^, . . .  ,ti;”)  is  a  set  of  parameters  for  the  transformation.  That  is, 
we  assume 

Li{xi{t,x,w),xi{t,x,w),xi{t,x,w))  =L(t,x,x)  (78) 

for  i  =  1,2. 

Next  we  shall  state  the  Noether  Invariance  relations  referred  to  in  Theorem 
5.1.  We  first  need  some  notation.  First  we  define  two  classes  of  infinitesimal 
transformations  to  each  Agent  i. 


CJXT' 

=  (79) 

2ind 

dti  , 

dyjk  l®=0  “ 

Next  let  Eij,  =  1, . . .  ,n,  be  the  Euler-Lagrange  operators  for  Li.  That  is,  let 


This  given,  the  Noether  Invariance  Relations  are  given  by 


n 


i=i 


(82) 


for  A:  =  1, . . . ,  n 

The  idea  is  that  if  the  state  estimation  Lagrangian  Li  has  an  extremal,  i.e. 
if  there  exists  a  curve  Xi{t)  =  Ci{t)  which  minimizes 


j,  Xt,  Xi)dti 


(83) 


where  x(to)  =  d-  A)  =  xi,  then  L  must  satisfy  the  Euler-Lagrainge 

equations  Eij{Li)  =  0  for  j  =  1, . . .  ,ni.  In  that  case  the  left  hand  side  of  the  ^ 
equations  in  (82)  are  0  and  hence  there  are  constants  d^k  such  that 


LiTi,k  +  ^  =  Ci.*  (84) 

j=l 

for  A;  =  1, . . . ,n.  There  are  two  important  aspects  about  (84).  First  note  that 
given  the  constants  Ci,ki  we  can  solve  for  the  infinitesimals  Ti^k  and  and 

integrate  to  recover  the  desired  transformation  x^  and  i*.  We  will  illustrate  this 
type  of  calculation  with  a  simple  example  below.  Second,  note  that  we  can 
monitor  the  failure  of  synchronization  of  the  state  estimation  Lagrangians  by 
simply  observing  that  the  left  hand  side  of  (84)  is  not  a  constant.  If  the  left  hand 
side  of  (84)  is  not  a  constant,  then  we  know  that  the  current  agent  Lagrangian 
is  not  compatible  with  the  system  Lagrangian  so  that  Agent  i  would  use  the 
Adapter  to  reconstruct  his  Estimation  Lagrangian. 


Example  5.1  A  simplified  range  model  of  radar  returns  has  the  following  sys¬ 
tem  Lagrangian. 

i^  =  t^(y-y).  (85) 

Now  suppose  that  Agent  1  has  infinitesimal  transformations 


ti=t  +  T{x,t)w  (86) 

xi  =  X  H-  0(x,  t)w  (87) 

We  assume  that  x  =  x(t)  satisfies  the  Euler-Lagrange  equation  so  that 

_  d  /dL.  dL 
dt^dx^  dx 
=  2tx  +  t^x  —  t^x^. 


Thus 


t^x  =  t^x^  -  2fx 


(88) 
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Then  from  (82) ,  we  have  that 


d  r-  /A  ‘  1 

^[Lr+-^(9-xr]  = 


ilt^±e-^^  +  ‘^)r]  =  o 

Expanding  the  derivative  in  (89),  using  (88)  to  replace  t^x,  and  collecting  terms, 
we  obtain  the  following  relations. 


: 

(89) 

X^ 

,89  1=i'i'8t_ 

^  dt  6  6  dx 

(90) 

x^ 

,  ^de  t^dr  . 

(91) 

t^dr  „ 

■  2  dx~ 

(92) 

Note  that  by  (92),  r  =  r(t)  is  just  a  function  oft.  Also  (92)  and  (90)  imply  that 
H  =  0  50  that  9  =  6{x)  is  just  a  function  of  x.  This  means  that  (89)  and  (91) 
are  just  ordinary  differential  equations  which  one  can  easily  solve  via  standard 
power  series  methods  to  show  that 

T  =  at  (93) 

e  =  -|i.  (94) 

where  a  is  an  arbitrary  constant  Hence 

=:  t-h  atw  (95) 

a 

=  X  —  -rXW,  (96) 

Of  course,  the  constant  a  can  be  determined  from  initial  conditions. 

6  Conclusions 

We  have  shown  that  our  MAHEA  architecture  is  an  effective  mechanism  to  solve 
various  problems  of  estimating  a  process  in  which  the  data  available  is  dynamic, 
noisy,  and  given  in  a  multiplicity  of  representations.  A  MAHEA  agent  network 
for  estimating  plant  state  is  an  efficient  mechanism  for  state  estimation  which  is 
extensible,  robust,  scalable,  allows  cross-checking,  and  supports  heterogeneous 
information. 

Deployment  of  an  agent  based  system  is  very  simple.  As  soon  as  a  new 
source  of  information  is  available,  a  new  estimation  agent  is  spawned  whose 
Knowledge  Base  is  a  model  of  the  plant  covered  by  that  source  (extensibility). 
Moreover  no  common  representation  of  the  data  is  required  so  that  the  system 
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supports  heterogeneous  information  sources.  Thus  our  architecture  eJlows  us  to 
incorporate  existing  models  and  estimation  techniques.  The  built-in  invariance 
condition  tests  for  the  validity  of  the  data  (cross-checking). 

We  have  shown  that  the  existing  theory  of  hybrid  systems  and  relaxed  vari¬ 
ation  optimization  can  be  adapted  to  the  state  estimation  problem  for  the  plant 
by  constructing  a  Lagrangian  which  becomes  0  at  points  which  correspond  to 
consistent  estimates  of  the  plant  and  is  positive  at  points  which  are  not  consis¬ 
tent.  This  property  ensures  that  when  an  agent  reaches  consistent  estimates  of 
the  plant,  the  evolution  of  the  system  produced  by  the  flow  of  the  corresponding 
geodesic  field  is  adapted  to  the  current  information  of  the  plant  as  viewed  by 
each  agent. 

Another  key  result  for  agent  synchronization  is  a  theorem  which  we  call  the 
Theveninin  Theorem  which  states  in  a  network  with  many  agents,  an  individual 
agent  A  can  view  the  rest  of  the  sensor  agents  as  a  single  aggreagated  agent  • 
C(A)  called  A’s  companion  agent.  Since  an  agent  sees  the  rest  of  the  estimate 
agent  network  as  a  single  equivalent  estimation  agent,  the  architecture  maintains 
linear  complexity  even  as  more  agents  are  added.  Details  of  these  results  will 
appear  in  a  future  paper. 

Finally  the  robustness  of  the  sensor  agent’s  estimates  follows  from  the  fact 
that  each  MAHEA  agent  is  computed  by  a  finite  state  machine  which  is  Lya¬ 
punov  stable  because  the  mapping  induced  by  an  agent  is  a  contreiction  mapping. 

We  note  that  the  techniques  introduced  in  this  paper  for  synchronization  and 
consistency  of  state  estimates  can  be  adapted  to  solve  synchronization  problems 
for  our  Multiple  Agent  Hybrid  Control  Architecture,  Again  details  will  be  given 
a  future  paper. 
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